Method for run time hardware code profiling for algorithm acceleration

نویسندگان

  • Vladimir Matev
  • Eduardo de la Torre
  • Teresa Riesgo
چکیده

In this paper we propose a method for run time profiling of applications on instruction level by analysis of loops. Instead of looking for coarse grain blocks we concentrate on fine grain but still costly blocks in terms of execution times. Most code profiling is done in software by introducing code into the application under profile witch has time overhead, while in this work data for the position of a loop, loop body, size and number of executions is stored and analysed using a small non intrusive hardware block. The paper describes the system mapping to runtime reconfigurable systems. The fine grain code detector block synthesis results and its functionality verification are also presented in the paper. To demonstrate the concept MediaBench multimedia benchmark running on the chosen development platform is used.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Profiling Tools for FPGA-Based Embedded Systems: Survey and Quantitative Comparison

Profiling tools are computer-aided design (CAD) tools that help in determining the computationally intensive portions in software. Embedded systems consist of hardware and software components that execute concurrently and efficiently to execute a specific task or application. Profiling tools are used by embedded system designers to choose computationally intensive functions for hardware impleme...

متن کامل

Adding Hardware Support to the HotSpot Virtual Machine for Domain Specific Applications

Like real general-purpose processors, Java Virtual Machines (JVMs) need hardware acceleration for computationally intensive applications. JVMs however require that platform independence can be maintained while resorting to hardware acceleration. To this end, we invented a scheme to seamlessly add hardware support to Sun’s HotSpot JVM. By means of run-time profiling, we select the most heavily u...

متن کامل

Run-Processing: A Coherence-oriented Processing Method and its Hardware Architecture for Real-time Video Object Segmentation

A coherence-oriented run-processing method and its hardware architecture is proposed for multiple widely-used techniques in video object segmentation algorithms including connected component labeling, binary morphology, seeded region growing and bitwise operations of binary masks. Experimented result shows that this run-domain processing method allows significant computation acceleration, memor...

متن کامل

Execution Time Minimization Based on Hardware/Software Partitioning and Speculative Prefetch

This report addresses the problem of minimizing the average execution time of an application, based on speculative FPGA configuration prefetch. Dynamically reconfigurable systems (like FPGAs) provide both the performance of hardware acceleration and the flexibility and adaptability that modern applications require. Unfortunately, one of their main drawbacks that significantly impacts performanc...

متن کامل

Accurate Profiling and Acceleration Evaluation of the Smith-waterman Algorithm Using the Molen Platform

In this paper, we present an accurate method to evaluate the amount of acceleration gained by the hardware implementation of the Smith-Waterman algorithm. This is done using the MOLEN Processor Prototype (MOLEN platform), where algorithms can be executed both as software or as reconfigurable hardware. By profiling the algorithm, we identify a specific function that consumes 78% of the total run...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009